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METHOD FOR COMPUTING AND USING FUTURE COSTING DATA IN SIGNAL 

ROUTING 


FIELD OF THE INVENTION 

[ 0001 ] The present invention relates to programmable logic 
devices (PLDs), and more particularly to a method for routing 
signals in programmable logic devices. 

BACKGROUND OF THE INVENTION 

[ 0002 ] A programmable logic device (PLD) is a digital 
circuit implementation platform that can be configured by the 
end user to implement a logic circuit of their choice. The 
two most common types of PLDs are field-programmable gate 
arrays (FPGAs) and complex programmable logic devices 
(CPLDs) . Many PLD vendors provide computed-aided design tools 
to help their customers to implement logic designs in their 
PLDs. One type of PLD, the FPGA, is used as an example to 
illustrate the present invention. It should be noted that, in 
addition to PLDs, the present invention is applicable to 
other integrated circuit implementation technologies, for 
example, application specific integrated circuits (ASICs) , 
mask-programmed gate arrays and standard cell devices. 

[ 0003 ] In a typical architecture, an FPGA includes an 
array of configurable logic blocks (CLBs) surrounded by 
programmable input/output blocks (lOBs) . The CLBs and IOBs 
are interconnected by a hierarchy of programmable routing 
resources, which are comprised of metal wire segments and 
programmable routing switches, also referred to as 
programmable interconnection points (PIPs) . These CLBs, 

IOBs, and programmable routing resources are customized by 
loading a configuration bitstream into the FPGA. State-of-the 
art FPGAs contain tens of thousands of CLBs. For such 
devices, the task of establishing the required 
. interconnections between the primitive cells inside a CLB and 
between the CLBs themselves becomes so onerous that it can 
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only be accomplished with the assistance of computed-aided 
design tools. Accordingly, the manufacturers of FPGAs , 
including the assignee hereof, Xilinx, Inc., have developed 
place and route software tools which may be used by end 
customers to implement their respective designs in their 
FPGAs . 

[ 0004 ] The time to execute conventional routing software 
(also called a router) can be very long, e.g.,. many hours. A 
typical design needs numerous iterations in testing and 
modification, and each iteration requires running the routing 
software. As a result, design cycles are affected by the 
execution time of the routing software. Consequently, there 
is a need to improve the execution time of routing software. 

SUMMARY OF THE INVENTION 

[ 0005 ] "Future costing" is a well-known method used to 
improve the run-time required to route signals in integrated 
circuits . The present invention involves a new method for 
determining and using future cost in routing. To the router, 
the integrated circuit interconnect is modeled using a 
routing graph, containing a set of nodes and directed edges. 
The nodes in the graph represent conductors in the integrated 
circuit. Edges are present between nodes that correspond to 
conductors that can be electrically connected to one another. 
In one embodiment of the present invention, a two-dimensional 
table is used to store future cost data. The entries in the 
table are indexed by x and y distances between nodes in the 
routing graph. To populate the table with future cost 
values, pre-routing is performed from a specially chosen 
source node to other nodes in the routing graph. At each node 
visited during pre-routing, both the cumulative routing cost 
and the x and y distances from the source node are computed. 
If there is no entry in the table that corresponds to the 
specific x, y-distance, the cumulative routing cost is 
entered into the corresponding position in the table. If the 
table contains existing future cost data corresponding to 
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this distance, the table entry is set equal to the cumulative 
routing cost if the cumulative routing cost is less than the 
value of the existing future cost data. Otherwise, the 
existing future cost remains. The normal routing of a user's 
signals commences after the pre-routing is completed. During 
routing, values from the future cost table are used to reduce 
the run-time needed for routing a design's signals. 

[ 0006 ] The above summary of the present invention is not 
intended to describe each disclosed embodiment of the present 
invention. The figures and detailed description that follow 
provide additional example embodiments and aspects of the 
present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[ 0007 ] Fig. 1 is a flow chart showing an exemplary routing 
process of the present invention. 

[ 0008 ] Fig. 2 is a flow chart showing an exemplary process 
that can be used to calculate the future cost in accordance 
with the present invention. 

[ 0009 ] Fig. 3 shows an exemplary future cost table of the 
present invention. 

[ 0010 ] Fig. 4 is a flow chart showing an exemplary process 
that can be used to route a connection between a signal 
source pin and a signal load pin using the future cost 
generated in accordance with the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[ 0011 ] The present invention relates to a new method for 
routing signals. In the following description, numerous 
specific details are set forth in order to provide a more 
thorough understanding of the present invention. However, it 
will be apparent to one skilled in the art that the present 
invention may be practiced without these specific details. 

In other instances, well-known features have not been 
described in detail in order to avoid obscuring the present 
invention. 


3 



X- 1018 US 


PATENT 


[0012] A widely used algorithm used for routing signals in 
PLD devices is a variant form of the well-known maze router, 
as described in C. Y. Lee, "An Algorithm for Path Connections 
and its Applications," in IRE Trans, on Electronic Computers, 
Vol . EC=10, 1961, pp. 346-365. This technique is general and 
adapts very well to some PLD interconnect architectures, such 
as the FPGA routing fabric. From the router's perspective, 
the integrated circuit interconnect as modeled by a "routing 
graph" , containing a set of nodes and directed edges . The 
nodes in the routing graph represent conductors in the 
integrated circuit. Edges are present between nodes that 
correspond to conductors that can be electrically connected 
to one another. For example, in FPGA devices, edges are 
present between nodes corresponding to conductors that may be 
connected to each other through a programmable routing 
switch. A signal in a design is a set of pins that must be 
connected together electrically. A signal is generally 
comprised of a source pin and one or more load pins. The 
pins on a signal correspond to specific nodes in the routing 
graph. To route a signal, the router identifies paths 
between the signal's pin nodes in the routing graph. The maze 
router algorithm searches for a path between a signal source 
pin and load pins by performing a breadth first search on the 
routing graph. Nodes in the routing graph are assigned costs, 
corresponding to characteristics of their corresponding 
conductors (described below). The maze router's goal is to 
find the minimum cost path between a signal's source pin and 
its load pin(s). In general, the maze router is effective in 
finding high-quality solutions. Its main disadvantage is poor 
run-time performance. 

[0013] To improve run-time performance, the concept of 
"future costing" was introduced, a description of it can be 
found in Soukup, "Fast Maze Router," Proceedings of Design 
Automation Conference 1978, pp. 100-102. The basic idea in 
future costing is to use a new cost value (the future cost) 
to speed up the maze router by pruning and reducing its 
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search space while maintaining the quality of its solution. 
Future costing works in the following way: as the maze router 
searches for a path in the routing graph between a signal's 
source pin and one of its load pins, it considers using 
various nodes from the routing graph in the path to the load 
pin. As mentioned above, each of these nodes has a certain 
intrinsic routing cost, which reflects the delay of the node, 
its length, capacitance or other criteria. To this intrinsic 
cost, a future cost is added. The future cost for a node is 
a lower bound on the cost of the complete path from the node 
to the target pin being routed. Consequently, the nodes that 
have low future costs are likely to be the ones used in low- 
cost paths to the load pin being routed. Future costing can 
be thought of as guiding the maze router towards low-cost 
routing solutions. By exploring favorable paths first and 
postponing others, it effectively reduces the search space of 
the maze router. One key aspect of developing a future cost 
function is to ensure that it does not result in the pruning 
of high-quality routing solutions and thus damaging the 
router's quality of result. This condition is met when the 
future cost is an accurate lower bound on the path cost from 
a node to the target being routed. A problem that arises 
with the use of future cost is that there is no systematic 
way to determine the future cost function that specifies the 
future cost values used during routing. The development of a 
future cost function for a router has historically required a 
certain amount of "black magic". Many engineering hours need 
to be spent in pouring over device routing architectures and 
developing heuristics and experimental parameters in order to 
arrive at the future cost function. For each device family 
and speed grade, there is a set of device specific future 
cost functions that have to be designed and developed from 
scratch. Once in place, these future cost functions are 
effective in reducing the overall router run-time, usually by 
a significant factor. The downside of the conventional method 
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of developing such functions is that the science is not 
exact, and it is extremely consuming in engineering time. 
[ 0014 ] One aspect of the present invention is a new future 
costing approach. This approach not only provides similar or 
better run-time improvements as prior approaches, but it is 
generic and is not device specific. Once this new future cost 
mechanism is in place, all device families can leverage from 
and use the same machinery to reap significant run-time 
benefits . 

[ 0015 ] Fig. 1 is a flow chart of an exemplary routing 
process 100 of the present invention. In step 102, a two- 
dimensional array (which can be implemented as a simple 
table, called herein the future cost table) is set up to 
store future cost data. In step 104, all the future cost data 
that will be used in subsequent steps is generated. In step 
106, process 100 determines whether all the signals have been 
routed. If all have been routed, process 100 terminates. If 
signals remain to be routed, the next signal is obtained 
(step 110). In step 112, the future cost data (obtained from 
step 104) is used to route a connection between the selected 
signal's source pin its load pins. Process 100 then returns 
to step 106. 

[ 0016 ] One novel aspect of the present invention is how 
the future cost values are calculated. In the present 
invention, these values are generated dynamically, and they 
accurately represent the low-bound estimated cost to complete 
a route from a given node to a target pin node. In one 
embodiment, to populate the values of the future cost table, 
pre-routing on an empty device is performed without any 
signal obstructions. The table (mentioned in step 102) is 
used to record the minimum costs required to route from a 
pre-defined source to all nodes in the device. 

[ 0017 ] Fig. 2 shows an exemplary process 130 that can be 
used to calculate the future cost in accordance with the 
present invention. Each node in the PLD is assigned a two- 
dimensional coordinate (X and Y) . In this embodiment, it is 
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assumed that each node (e.g., node N) has a cost. The cost is 
designated as Cost(N) . It should be noted that the present 
invention is applicable to both resource, delay, and other 
routing costs. Further, node N is also associated with a 
cumulative routing cost, Cul_R_Cost (N) , representing the sum 
of the individual node costs on a path from the pre-defined 
source node to the node, N. The formula for computing this 
cumulative cost will be described below. 

[ 0018 ] In step 132, a source node (S) is selected. In 
applying this method to an FPGA, a CLB output located in the 
corner of a FPGA can be selected as the source. The intrinsic 
routing cost of node S is calculated, and is designated 
Cost(S) in process 130. In step 134, node S is added to a 
priority queue (W) . The cumulative routing cost of node S, 
designated as Cul_R_Cost (S) , is initially set equal to 
Cost(S) . In step 136, process 130 determines whether queue W 
is empty. If the queue is empty, process 130 terminates. If 
the queue is not empty, process 130 obtains a node (M) from 
queue W that has the lowest cumulative routing cost. This 
node M is now marked as "visited" (step 138) . When step 138 
is first performed, source node S is the only node in queue 
W. It is automatically treated as node M in this iteration. 

In step 140, process 130 obtains an unvisited node (K) that 
neighbors node M in the routing graph (there is an edge 
between nodes K and M in the routing graph) . In step 142, 
node K is added to queue W. This node is now selectable in 
the next iteration of step 138. The cumulative routing cost 
of node K is set equal to the sum of the cumulative routing 
cost of node M, Cul_R_Cost (M) , and the routing cost, Cost(K), 
of node K. This cumulative routing cost will be used in the 
computation of the cumulative routing cost of node K's 
neighboring nodes, if node K is selected in a subsequent 
iteration of step 138. In step 144, process 130 computes the 
Manhattan distance between nodes K and S. Two distance values 
(delta X and delta Y) are obtained, one for the x-dimension 
and one of the y-dimension. These x and y distance values are 
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used to index a value in the future cost table. In step 146, 
a cost value is entered into the table at a location 
determined by this coordinate. This cost value is equal to 
the minimum of the cumulative routing cost of node K (see 
above in connection with step 142) and the existing future 
cost value in the table. In step 148, process 130 determines 
whether all the neighbors of node M have been considered. If 
not all the neighbors have been considered, process 130 jumps 
back to step 140 and obtains another unvisited node that 
neighbors node M. If all the neighbors have been considered, 
process 130 jumps back to step 136. At the end of process 
130, each entry of the future cost table contains a numerical 
value that represents a lower bound on the cumulative routing 
cost required to route a specific distance. 

[ 0019 ] An exemplary future cost table 160 is shown in Fig. 
3. Future cost data are entered into various locations inside 
table 160, such as locations 162-165. 

[ 0020 ] In summary, the minimum future cost values are 
stored and organized in a two-dimensional array such that the 
indices represent the vertical and horizontal Manhattan 
distances between two nodes. In order to query the future 
cost value for a target that is M horizontal units and N 
vertical units away from the current node, one could simply 
look up and retrieve the value located at array location (M, 
N ) . It is important to consider the amount of run-time in 
executing process 130 of Fig. 2 because it may be significant 
relative to the overall router run-time required to route all 
user signals. In the above-described preferred embodiment, a 
single "expansion" is used to find and record the minimum 
cost paths from the pre-defined source to each node in the 
device. Each node in the routing graph is visited at most 
once in such an expansion procedure. In another embodiment, 
the future cost table need not be computed at router run- 
time. The table could be pre-computed using process 130 and 
its contents stored on disk. When the router is invoked, it 
would simply read the table from the disk, prior to 
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coiranencing to route signals. Note that this table may be re- 
generated whenever the underlying integrated circuit 
experiences changes in its routing connectivity, or changes 
in its routing delays, or when the costs assigned to nodes in 
the routing graph change. 

[ 0021 ] A process 200 for routing connections using the 
future cost of the present invention is shown in Fig. 4. Note 
that process 200 provides a detailed description of step 112 
of Fig. 1. In step 202, the source node S corresponding to 
the source node or pin of the signal being routed is located. 
(The signal being routed is identified in step 110 of Fig. 

1.) The routing cost of this node is calculated, and is 
designated Cost(S) in process 200. In step 204, node S is 
added to a priority queue W. The cumulative routing cost, 
Cul_R_Cost (S) , is initially set equal to Cost(S). In step 
208, process 200 determines whether queue W is empty. If the 
queue is empty, process 200 terminates. If the queue is not 
empty, process 200 obtains the next node in the routing, node 
(N) , from queue W (step 210) . In this embodiment, node N has 
the lowest cumulative routing cost among all the nodes in 
queue W.. When step 210 is first performed, source node S is 
the only node in queue W. It is automatically treated as the 
node N. Process 200 is used for routing from the source pin 
of a net to a given target load pin on the net. The target 
load pin has a corresponding target node in the routing 
graph. In step 212, process 200 determines whether node N is 
the target node. If this is the target node, the minimum cost 
routing path can be found by tracing back from this target 
node to the source node S. If this is not the target node, a 
neighbor node (T) of node N is obtained (step 216) . Node T 
can be an arbitrary neighbor node because, as explained 
below, all the neighbor nodes will be considered. The 
Manhattan distance (in the X and Y directions) between node T 
and the target node is calculated. These two distance values 
are used as indices to locate a corresponding entry in the 
future cost table. The routing cost of node T, Cul_R_Cost (T) , 


9 



X- 10 18 US PATENT 

is set equal to the sum of the cumulative routing cost of 
node N, Cul_R_Cost (N) , the routing cost of T, Cost(T), and 
the corresponding entry in the future cost table (step 218). 
Node T is then added to priority queue W (step 220) . This 
node is now selectable in the next iteration of step 210. The 
cumulative routing cost Cul_JR_Cost (T) will be used in the 
computation of the cumulative routing cost of its neighboring 
nodes, if T is selected in a subsequent iteration of step 
210. In step 222, process 200 determines whether all the 
neighbor nodes of node N have been considered. If the answer 
is negative, process 200 returns to step 216 (obtain another 
neighbor node of node N) . If the answer is positive, process 
200 returns to step 210 (obtain another node in queue W) . 
[ 0022 ] A different embodiment for generating a future cost 
table is now described. In order to reduce or limit the run- 
times of generating the future cost table associated with 
large devices, the search space of process 130 on the device 
can be bounded. Pre-defined maximum horizontal and vertical 
limits, labeled as Max_X and Max_Y , are assigned. This means 
that the recorded future cost table will contain entries up 
to Max_x and Max_Y distances from the source. If step 218 of 
process 200 encounters targets that lay outside of these 
distance limits, future cost values can be estimated using 
the entries located in the table, as explained below. 

[ 0023 ] In order to facilitate the estimation, memory 
locations 170, shown in Fig. 3, can be used to store minimum 
cost future cost slopes for the x and y dimensions. In this 
embodiment, the cost slopes are equal to the quotient of 
dividing the future cost by the corresponding Manhattan 
distance. When a future cost data value is stored in a future 
cost table (step 146 of process 130) , the cost slope is 
computed by dividing the current cost with the current x,y- 
Manhattan distances. A computed cost slope for the x or y 
dimension will replace its corresponding value in memory 
location 170 if the computed value is less than the existing 
value. As a result, at the end of process 130, memory 
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location 170 stores the current minimum cost slope for both 
the x and y dimensions. A future cost value for a node 
outside of the distance limit (Max_X, Max_Y) can be computed 
by multiplying the x, y-Manhattan distances from that node to 
the target with the minimum cost slope values in memory 
location 170. In essence, this method provides a conservative 
lower bound estimation based on known future cost values. 
[ 0024 ] To further increase the accuracy of the future cost 
values, different types of device nodes can be considered 
separately. For example, in an FPGA environment, distinctions 
can be made wire segments of different lengths, for example, 
long lines, hex lines, double lines, etc. Also, distinctions 
can be made between different target load pin types, for 
example, logic block input pins, clock pins, etc. Separate 
future cost tables can be generated for each type of 
resource, thus improving the accuracy of the future cost 
function and further pruning the search space of the router. 
[ 0025 ] This new invention yields improvements in router 
run-time that are similar or better than prior methods. In 
addition, the computation of the future cost function is 
generic and is de-coupled from the device-specific structure 
of an integrated circuit. Both PLDs and other types of 
integrated circuit implementation technologies can benefit 
from this invention. 

[ 0026 ] It can be seen from the above description that a 
novel routing method has been disclosed. Those having skill 
in the relevant arts of the invention will now perceive 
various modifications and additions which may be made as a 
result of the disclosure herein. Accordingly, all such 
modifications and additions are deemed to be within the scope 
of the invention, which is to be limited only by the appended 
claims and their equivalents . 
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